Adaptive Generalized Fused-Lasso: Asymptotic Properties and Applications
نویسندگان
چکیده
The Lasso has been widely studied and used in many applications over the last decade. It has also been extended in various directions in particular to ensure asymptotic oracle properties through adaptive weights (Zou, 2006). Another direction has been to incorporate additional knowledge within the penalty to account for some structure among features. Among such strategies the Fused-Lasso (Tibshirani et al., 2005) has recently been extended to penalize differences of coefficients corresponding to features organized along a network, through the Generalized Fused-Lasso. In this work we investigate the theoretical and empirical properties of the Adaptive Generalized Fused-Lasso in the context of Generalized Linear Models, with emphasis on Logistic Regression. More precisely, we establish its asymptotic oracle properties and propose an extensive simulation study to explore its empirical properties. We especially show that it compares favorably with other strategies. We also propose an adaptation of the Relaxed Lasso (Meinshausen, 2007). Finally we present an original application of the Generalized Fused-Lasso to the Joint Modeling framework where the design itself suggests the graph to be used in the penalty; an illustration is provided on road safety data. The views and opinions expressed herein are those of the author and do not necessarily reflect the views of Novartis. 1 ha l-0 08 13 28 1, v er si on 1 16 A pr 2 01 3
منابع مشابه
On the robustness of the generalized fused lasso to prior specifications
Using networks as prior knowledge to guide model selection is a way to reach structured sparsity. In particular, the fused lasso that was originally designed to penalize differences of coefficients corresponding to successive features has been generalized to handle features whose effects are structured according to a given network. As any prior information, the network provided in the penalty m...
متن کاملSome Properties of Generalized Fused Lasso and Its Applications to High Dimensional Data
Identifying homogeneous subgroups of variables can be challenging in high dimensional data analysis with highly correlated predictors. The generalized fused lasso has been proposed to simultaneously select correlated variables and identify them as predictive clusters. In this article, we study several properties of generalized fused lasso. First, we present a geometric interpretation of the gen...
متن کاملProperties and Refinements of The Fused Lasso
We consider estimating an unknown signal, which is both blocky and sparse, corrupted by additive noise. We study three interrelated least squares procedures and their asymptotic properties. The first procedure is the fused lasso, put forward by Friedman et al. (2007), which we modify into a different estimator, called the fused adaptive lasso, with better properties. The other two estimators we...
متن کاملSplit Bregman method for large scale fused Lasso
Abstract: Ordering of regression or classification coefficients occurs in many real-world applications. Fused Lasso exploits this ordering by explicitly regularizing the differences between neighboring coefficients through an l1 norm regularizer. However, due to nonseparability and nonsmoothness of the regularization term, solving the fused Lasso problem is computationally demanding. Existing s...
متن کاملFused Adaptive Lasso for Spatial and Temporal Quantile Function Estimation
Quantile functions are important in characterizing the entire probability distribution of a random variable, especially when the tail of a skewed distribution is of interest. This article introduces new quantile function estimators for spatial and temporal data with a fused adaptive Lasso penalty to accommodate the dependence in space and time. This method penalizes the difference among neighbo...
متن کامل